Embedded memory and methods of forming the same

ABSTRACT

An embedded flash memory device includes a gate stack, and source and drain regions in the semiconductor substrate. The first source and drain regions are on opposite sides of the gate stack. The gate stack includes a bottom dielectric layer over the semiconductor substrate, a charge trapping layer over the bottom dielectric layer, a top dielectric layer over the charge trapping layer, a high-k dielectric layer over the top dielectric layer, and a metal gate over the high-k dielectric layer.

BACKGROUND

Flash memories, which use dielectric trapping layers or floating layers to store charges, are often used in System-On-Chip (SOC) technology, and are formed on the same chip along with other integrated circuits. For example, High-Voltage (HV) circuits, Input/output (IO) circuits, core circuits, and Static Random Access Memory (SRAM) circuits are often integrated on the same chip as the flash memories. The respective flash memories are often referred to as embedded memories since they are embedded in the chip on which other circuits are formed, as compared to the flash memories formed on chips that do not have other circuits. Flash memories have structures different from HV circuit devices, IO circuit devices, core circuit devices, and SRAM circuit devices. Therefore, embedding memory devices with other types of devices faces challenges when the technology evolves.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the embodiments, and the advantages thereof, reference is now made to the following descriptions taken in conjunction with the accompanying drawings, in which:

FIGS. 1 through 13 are cross-sectional views of intermediate stages in the manufacturing of embedded memory devices and other types of devices in accordance with some exemplary embodiments, wherein a gate-last approach is used; and

FIGS. 14 through 16 are cross-sectional views of intermediate stages in the manufacturing of embedded memory devices and other types of devices in accordance with alternative embodiments, wherein a gate-first approach is used.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

The making and using of the embodiments of the disclosure are discussed in detail below. It should be appreciated, however, that the embodiments provide many applicable concepts that can be embodied in a wide variety of specific contexts. The specific embodiments discussed are illustrative, and do not limit the scope of the disclosure.

An embedded memory device and the methods of forming the same are provided in accordance with various exemplary embodiments. The intermediate stages of forming the embedded memory device are illustrated. The variations of the embodiments are discussed. Throughout the various views and illustrative embodiments, like reference numbers are used to designate like elements.

Referring to FIG. 1, semiconductor substrate 20, which is a part of semiconductor wafer 2, is provided. In some embodiments, semiconductor substrate 20 includes crystalline silicon. Other commonly used materials such as carbon, germanium, gallium, boron, arsenic, nitrogen, indium, phosphorus, and/or the like, may also be included in semiconductor substrate 20. Semiconductor substrate 20 may be a bulk substrate or a Semiconductor-On-Insulator (SOI) substrate. In some exemplary embodiments, semiconductor substrate 20 comprises Si_(1-z)Ge_(z), wherein value z is the atomic percentage of germanium in SiGe, and may be any value ranging from, and including, 0 and 1. For example, when value z is 0, semiconductor substrate 20 comprises a crystalline silicon substrate. When value z is 1, semiconductor substrate 20 comprises a crystalline germanium substrate. Substrate 20 may also have a compound structure including a III-V compound semiconductor on a silicon substrate, or a silicon germanium (or germanium) layer on a silicon substrate.

Semiconductor substrate 20 includes portions in regions 100, 200, 300, and 400. In accordance with some embodiments, regions 100, 200, 300, and 400 include an embedded flash memory region, a High-Voltage (HV) region, an Input/output (IO) region, and a Static Random Access Memory (SRAM)/general logic (core) device region, respectively. Embedded flash memory region 100 is used for forming embedded flash memory cells (such as 156 in FIGS. 13 and 16) therein. HV region 200 is used for forming HV devices (such as 256 in FIGS. 13 and 16) therein. IO Region 300 is used for forming IO devices (such as 356 in FIGS. 13 and 16) therein. Core/SRAM Region 400 is used for forming core devices and/or SRAM cells (such as 456 in FIGS. 13 and 16) therein. The core devices, sometimes referred to as logic devices, do not include any memory array therein, and may be, or may not be, in the peripheral region of SRAM arrays. For example, the core devices may be in the driver circuit or the decoder circuit of the SRAM arrays (in region 400) or the memories in region 100. The HV devices are supplied with, and are configured to endure, a positive power supply voltage Vdd1 higher than the positive power supply voltage Vdd2 of the devices in region SRAM/core region 400. For example, power supply voltage Vdd2 may be lower than about 1V, and power supply voltage Vdd1 may be between about 1.5V and about 3.3V. Although portions of substrate 20 in regions 100, 200, 300, and 400 are shown as disconnected, they are portions of the same continuous substrate 20.

As also shown in FIG. 1, bottom dielectric layer 22 is formed on substrate 20. In some embodiments, bottom dielectric layer 22 is formed of silicon oxide, which may be formed by performing a thermal oxidation on substrate 20. In alternative embodiments, bottom dielectric layer 22 comprises silicon oxynitride or other dielectric materials that have low leakage of charges, which may be deposited. In some embodiments, thickness T1 of bottom dielectric layer 22 is between about 20 Å and about 60 Å. It is appreciated, however, that the values recited throughout the description are merely examples, and may be changed to different values.

Charge trapping layer 24 is formed over bottom dielectric layer 22. Charge trapping layer 24 may be a dielectric layer with a high trap density. In some embodiment, charge trapping layer 24 comprises silicon nitride (SiN), which may be deposited using Physical Vapor Deposition (PVD). In other embodiments, charge trapping layer 24 include other materials including, and not limited to, oxides, nitrides, and oxynitrides. For example, charge trapping layer 24 may include AlN, Al₂O₃, HfO₂, HfON, ZrON, or combinations thereof.

Referring to FIG. 2, bottom dielectric layer 22 and charge trapping layer 24 are patterned in an etching step. The portions of bottom dielectric layer 22 and charge trapping layer 24 are removed from regions 200, 300, and 400. The portion of bottom dielectric layer 22 and charge trapping layer 24 in region 100 are left un-removed. After the patterning, as shown in FIG. 3, HV dielectric layer 26 is formed in regions 200, 300, and 400. Thickness T2 of HV dielectric layer 26 may be between about 50 Å and about 300 Å. Top dielectric layer 28 is also formed in region 100 and over charge trapping layer 24. Thickness T3 of top dielectric layer 28 may be between about 60 Å and about 200 Å.

In accordance with some embodiments, HV dielectric layer 26 and top dielectric layer 28 are formed in separate processes. For example, HV dielectric layer 26 is formed using thermal oxidation by oxidizing substrate 20. Accordingly, HV dielectric layer 26 is formed in regions 200, 300, and 400, and not in region 100. Top dielectric layer 28, on the other hand, may be formed using deposition, which may be performed using a Chemical Vapor Deposition (CVD) method such as Plasma Enhance CVD (PECVD), Low Pressure CVD (LPCVD), Atomic Layer Deposition (ALD), or the like. In these embodiments, thickness T2 of HV dielectric layer 26 and thickness T3 of top dielectric layer 28 may be different from each other. The materials of HV dielectric layer 26 and top dielectric layer 28 may also be different from each other or same as each other. With HV dielectric layer 26 and top dielectric layer 28 formed separately, thicknesses T2 and T3 may be adjusted to optimize the performance of the memory device and the HV device formed in regions 100 and 200, respectively. In other embodiments, HV dielectric layer 26 and top dielectric layer 28 are formed simultaneously in a same deposition process. In these embodiments, thickness T2 is equal to thickness T3. By forming HV dielectric layer 26 and top dielectric layer 28 simultaneously, the manufacturing cost may be reduced. HV dielectric layer 26 and top dielectric layer 28 may comprise silicon oxide, silicon oxynitride, or the like. The dielectric constant of the HV dielectric layer 26 and top dielectric layer 28 may be about 3.8 in some embodiments.

As shown in FIG. 4, HV dielectric layer 26 is patterned, and is removed from regions 300 and 400. Next, Referring to FIG. 5, 10 dielectric layer 30 is formed. In some embodiments, IO dielectric layer 30 comprises silicon oxide. Alternatively, IO dielectric layer 30 comprises silicon oxynitride. Thickness T4 of IO dielectric layer 30 may be between about 20 Å and about 70 Å, which may be smaller than thickness T2 of HV dielectric layer 26 in some embodiments. Similarly, IO dielectric layer 30 may be formed through the thermal oxidation of substrate 20, deposition, or the like. After the formation of IO dielectric layer 30, IO dielectric layer 30 is removed from region 400.

Referring to FIG. 6, interfacial layer 32 is formed on substrate 20. Interfacial layer 32 may comprise chemical oxide, a thermal oxide, or the like. In some embodiments, interfacial layer 32 is formed by oxidizing the exposed surface portion of substrate 20. In alternative embodiments, interfacial layer 32 is formed by treating the surface portion of substrate 20 using a chemical, for example, an oxidant such as ozone water or hydrogen peroxide. The resulting interfacial layer 32 is referred to as a chemical oxide layer, which comprises silicon oxide. Thickness T5 of interfacial layer 32 may be between about 5 Å and about 30 Å, which may be smaller than thickness T4 of IO dielectric layer 30 in some embodiments.

Referring to FIG. 7, high-k dielectric layer 34, capping layer 36, and polysilicon layer 38 are formed sequentially, and are formed in regions 100, 200, 300, and 400 simultaneously. Accordingly, each of layers 34, 36, and 38 has a same thickness and a same material in regions 100, 200, 300, and 400. High-k dielectric layer 34 may have a k value greater than about 7.0, and may include an oxide or a silicate of Hf, Al, Zr, La, Mg, Ba, Ti, Pb, Yb, Pr, Nd, Gd, Er, Dy, or combinations thereof. Exemplary materials of high-k dielectric layer 34 include MgO_(x), BaTi_(x)O_(y), BaSr_(x)Ti_(y)O_(z), PbTi_(x)O_(y), PbZr_(x)Ti_(y)O_(z), and the like, with values X, Y, and Z being between 0 and 1. The thickness of high-k dielectric layer 34 may be between about 0.5 nm and about 10 nm. The formation methods of high-k dielectric layer 34 may include Molecular-Beam Deposition (MBD), Atomic Layer Deposition (ALD), Physical Vapor Deposition (PVD), and the like.

Over high-k dielectric layer 34, capping layer 36 may be formed. In some embodiments, capping layer 36 comprises titanium nitride (TiN). In alternative embodiments, the exemplary materials of capping layer 36 include tantalum-containing materials and/or titanium-containing materials such as TaC, TaN, TaAlN, TaSiN, and combinations thereof. Polysilicon layer 38 is then formed over capping layer 36.

FIGS. 8 through 13 illustrate the formation of devices in regions 100, 200, 300, and 400 using a gate-last approach, wherein the gates of the devices are referred to as replacement gates. In these embodiments, polysilicon layer 38 acts as the dummy gate that is replaced by the replacement gates. Referring to FIG. 8, layers 22, 24, 26, 28, 30, 32, 34, 36, and 38 are patterned, forming layer stacks 140, 240, 340, and 440 in regions 100, 200, 300, and 400, respectively. In these embodiments, the top surface 38A of the portion of dummy gate 38 in region 400 is higher than top surface 36A of the portion of capping layer 36 in region 100 to ensure that after the subsequent CMP (FIG. 12), dummy gate 38 remains in each of layer stacks 140, 240, 340, and 440. After the patterning, lightly doped source and drain regions (not shown) and/or packet regions (not shown) may be formed adjacent to either one or all layer stacks 140, 240, 340, and 440.

Next, referring to FIG. 9, gate spacers 42 are formed on the sidewalls of layer stacks 140, 240, 340, and 440. In some embodiments, gate spacers 42 comprise silicon nitride, although other dielectric materials may also be used. The formation of gate spacers 42 includes forming a blanket layer(s), and performing an anisotropic etching to remove the horizontal portions of the blanket layer. The remaining portions of the blanket layer form gate spacers 42.

FIG. 10 illustrates the formation of source and drain regions 44, which are alternatively referred to as a source/drain regions 44 hereinafter. Source/drain regions 44 may be formed through implantation or epitaxy. The formation details of source/drain regions 44 are not discussed herein.

FIG. 11 illustrates the formation of Inter-Layer Dielectric (ILD) 46, which is formed of a dielectric material such as Phospho-Silicate Glass (PSG), Boro-Silicate Glass (BSG), Boron-Doped Phospho-Silicate Glass (BPSG), or the like. ILD 46 has a top surface higher than the top surface of layer stacks 140, 240, 340, and 440. A CMP may be performed to level the top surface of ILD 46 and the top surfaces of the layer stacks, as shown in FIG. 12.

Referring to FIG. 13, the remaining portions of polysilicon layer 38 are removed, for example, through etching, and are replaced with replacement gates. Hence, polysilicon layer 38 is referred to as a dummy gate throughout the description. The replacement gates include metal gate electrodes 152, 252, 352, and 452. Metal gate electrodes 152, 252, 352, and 452 may have a single layer structure or a multi-layer structure including a plurality of layers, which is schematically illustrated using reference notations 148 and 150. Metal gate electrode 152 forms the gate electrode of embedded flash memory 156. Metal gate electrode 252 forms the gate electrode of HV device (transistor) 256. Metal gate electrode 352 forms the gate electrode of IO device (transistor) 356. Metal gate electrode 452 forms the gate electrode of core or SRAM device (transistor) 456. Gate electrodes 152, 252, 352, and 452 may comprise metal or metal alloys such as Cu, W, Co, Ru, Al, TiN, TaN, TaC, combinations thereof, and multi-layers thereof.

In subsequent steps, contact openings (not shown) are formed in ILD 46, exposing underlying source/drain regions 44. Source/drain silicides and sourced/drain contact plugs (not shown) may be formed to electrically couple to source/drain regions 44. The formation of memory device 156, HV transistor 256, IO transistor 356, and core/SRAM transistor 456 is thus finished.

FIGS. 14 through 16 illustrate the formation of devices in regions 100, 200, 300, and 400 using a gate-first approach, as compared to the gate-last approach in FIGS. 8 through 13. In these embodiments, since polysilicon layer 38 form gate electrodes of the resulting devices, polysilicon layer 38 are not removed. The initial steps are essentially the same as in FIGS. 1 through 7. Next, the stacked layers in FIG. 7 are patterned to form gate stacks 140, 240, 340, and 440, as shown in FIG. 14. Gate spacers 42 are then formed on the sidewalls of the gate stacks. Lightly doped source and drain regions (not shown) and pocket regions (not shown) may also be formed at this step. Next as shown in FIG. 15, source/drain regions 44 are formed.

In a subsequent step, as shown in FIG. 16, ILD 46 is formed to cover gate stacks 140, 240, 340, and 440, followed by a CMP step to level the top surface of ILD 46. The top surface of the ILD 46 is higher than the top surfaces of gate stacks 140, 240, 340, and 440 in these embodiments. Next, contact plug openings (occupied by contact plugs 60) are formed by etching ILD 46. Polysilicon layer 38 and source/drain regions 44 are thus exposed. The exposed portions of polysilicon layer 38 and source/drain regions 44 are silicided to from silicide regions 58. The contact plug openings are then filled with a conductive material such as tungsten, aluminum, copper, and/or the like to form contact plugs 60.

In accordance with the embodiments of the present disclosure, as shown in FIGS. 13 and 16, high-k dielectric layer 34 is formed over the top dielectric layer 28 to form the blocking layer of the resulting embedded flash memory device 156. With the dual layer structure of the blocking layer, the thickness of the high-k dielectric and the top dielectric layer may be reduced without sacrificing the charge retention ability of the memory devices. On the other hand, with the formation of the metal gates in the memory device 156, the mismatch between the threshold voltages of different embedded flash memory devices is reduced. This is advantageous for the formation of flash memory devices having different threshold voltage levels in a same chip. With small mismatch, different levels of threshold voltages may be clearly distinguished from each other.

In accordance with some embodiments, an embedded flash memory device includes a gate stack, and source and drain regions in the semiconductor substrate. The first source and drain regions are on opposite sides of the gate stack. The gate stack includes a bottom dielectric layer over the semiconductor substrate, a charge trapping layer over the bottom dielectric layer, a top dielectric layer over the charge trapping layer, a high-k dielectric layer over the top dielectric layer, and a metal gate over the high-k dielectric layer.

In accordance with other embodiments, a device includes an embedded flash memory device and a transistor. The embedded flash memory device includes a first gate stack. The first gate stack includes a bottom silicon oxide layer over the semiconductor substrate, a charge trapping layer over the bottom silicon oxide layer, a top oxide layer over the charge trapping layer, a first high-k dielectric layer over and contacting the top oxide layer, a first metal capping layer over and contacting the first high-k dielectric layer, and a first metal gate over the first metal capping layer. The transistor includes a second gate stack, which includes an oxide layer over the semiconductor substrate, and a second high-k dielectric layer over and contacting the oxide layer. The first and the second high-k dielectric layers have a same thickness, and are formed of a same material. The second gate stack further includes a second metal capping layer over and contacting the second high-k dielectric layer, wherein the first and the second metal capping layers have a same thickness, and are formed of a same material. The second gate stack further includes a second metal gate over the second metal capping layer.

In accordance with yet other embodiments, a method includes forming a stack of layers, which includes forming a bottom dielectric layer over a semiconductor substrate and in a first device region, forming a charge trapping layer over the bottom dielectric layer, forming a top dielectric layer over the charge trapping layer, and forming a first high-k dielectric layer over the top dielectric layer. The method further includes patterning the stack of layers, forming a first metal gate over the stack of layers, and forming first source and drain regions in the semiconductor substrate. The stack of layers, the first metal gate, and the first source and drain regions form portions of an embedded flash memory device.

Although the embodiments and their advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the embodiments as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, and composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed, that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the disclosure. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps. In addition, each claim constitutes a separate embodiment, and the combination of various claims and embodiments are within the scope of the disclosure. 

What is claimed is:
 1. A device comprising: a semiconductor substrate; a flash memory device comprising: a first gate stack comprising: a bottom dielectric layer over the semiconductor substrate; a charge trapping layer over the bottom dielectric layer; a top dielectric layer disposed directly on the charge trapping layer; a first high-k dielectric layer over the top dielectric layer; and a first metal gate over the first high-k dielectric layer; and a first source and drain regions in the semiconductor substrate, wherein the first source and drain regions are on opposite sides of the first gate stack; and a transistor comprising: a second gate stack comprising: a second dielectric layer disposed directly on the semiconductor substrate; a second high-k dielectric layer over the dielectric layer; and a second metal gate over the second high-k dielectric layer; and a second source and drain regions in the semiconductor substrate, wherein the second source and drain regions are on opposite sides of the second gate stack, wherein the first high-k dielectric layer has the same thickness and is formed of the same material as the second high-k dielectric layer; wherein the top dielectric and second dielectric are the same thickness and material.
 2. The device of claim 1, wherein the top dielectric layer comprises silicon oxide or silicon oxynitride.
 3. The device of claim 1 further comprising a first metal capping layer overlying the first high-k dielectric layer and underlying the first metal gate.
 4. The device of claim 1, wherein the first metal gate and the second metal gate are formed of the same material, and have the same thickness.
 5. The device of claim 1 further comprising: an Input/output (IO) transistor comprising: a third gate stack comprising: an IO dielectric layer over the semiconductor substrate, wherein the IO dielectric layer has a thickness smaller than a thickness of the top dielectric layer; a third high-k dielectric layer over the IO dielectric layer; and a third metal gate over the third high-k dielectric layer; and third source and drain regions in the semiconductor substrate, wherein the third source and drain regions are on opposite sides of the third gate stack.
 6. The device of claim 1 further comprising: a transistor selected from the group consisting of a core transistor and a Static Random Access Memory (SRAM) transistor, the transistor comprising: a fourth gate stack comprising: an interfacial dielectric layer over the semiconductor substrate; a fourth high-k dielectric layer over the interfacial dielectric layer; and a fourth metal gate over the fourth high-k dielectric layer; and fourth source and drain regions in the semiconductor substrate, wherein the fourth source and drain regions are on opposite sides of the fourth gate stack.
 7. The device of claim 1, wherein the first metal gate comprises: a first conductive layer comprising a bottom portion, and sidewall portions over and connected to opposite ends of the bottom portion; and a second conductive layer overlapping the bottom portion, with sidewalls of the second conductive layer contacting sidewalls of the sidewall portions of the first conductive layer.
 8. The device of claim 1, wherein the first high-k dielectric layer physically contacts the first portion of the top dielectric layer and wherein the second high-k dielectric layer physically contacts the second portion of the top dielectric layer.
 9. The device of claim 1, wherein the first metal gate and the second metal gate have a multi-layer structure.
 10. The device of claim 1, wherein the first metal gate has a first thickness, the second metal gate has a second thickness, and the second thickness is larger than the first thickness.
 11. A device comprising: a semiconductor substrate; a flash memory device comprising a first gate stack, wherein the first gate stack comprises: a bottom silicon oxide layer over the semiconductor substrate; a charge trapping layer over the bottom silicon oxide layer; a top oxide layer over and contacting the charge trapping layer; a first high-k dielectric layer over and contacting the top oxide layer; a first metal capping layer over and contacting the first high-k dielectric layer; and a first metal gate over the first metal capping layer; and a transistor comprising a second gate stack, wherein the second gate stack comprises: an oxide layer over and contacting the semiconductor substrate, wherein the top oxide layer and oxide layer are formed of the same material; a second high-k dielectric layer over and contacting the oxide layer, wherein the second high-k dielectric layer has the same thickness and is formed of the same material as the first high-k dielectric layer; a second metal capping layer over and contacting the second high-k dielectric layer, wherein the first and the second metal capping layers have the same thickness, and are formed of the same material; and a second metal gate over the second metal capping layer.
 12. The device of claim 11, wherein the top oxide layer and the oxide layer are silicon oxide layers.
 13. The device of claim 11, wherein the first and the second metal capping layers comprise titanium nitride.
 14. A device comprising: a semiconductor substrate; a flash memory device comprising a first gate stack, wherein the first gate stack comprises: a bottom silicon oxide layer over the semiconductor substrate; a charge trapping layer over the bottom silicon oxide layer; a top oxide layer disposed physically contacting the charge trapping layer; a first high-k dielectric layer over the top oxide layer; a first conductive capping layer over and contacting the first high-k dielectric layer; and a first conductive gate over the first conductive capping layer; and a transistor comprising a second gate stack, wherein the second gate stack comprises: a second top oxide layer disposed physically contacting the semiconductor substrate and underlying a second high-k dielectric layer; the second high-k dielectric layer over the second top oxide layer; wherein the second gate stack further comprises a second conductive capping layer over and contacting the second high-k dielectric layer; and a second conductive gate over the second conductive capping layer; wherein the first high-k dielectric layer and second high-k dielectric layer are of the same material; wherein the first conductive capping layer and the second conductive capping layer are of the same material; wherein the top oxide layer and second top oxide layer are the same thickness and material.
 15. The device of claim 14, wherein the first conductive gate and the second conductive gate comprise metal gates, with top surfaces of the first conductive gate and the second conductive gate being coplanar.
 16. The device of claim 15, wherein the first conductive gate has a height smaller than a height of the second conductive gate.
 17. The device of claim 14, wherein the first conductive gate and the second conductive gate comprise polysilicon gates, with a top surface of the first conductive gate being higher than a top surface of the second conductive gate.
 18. The device of claim 14, wherein the first conductive capping layer comprises titanium or tantalum.
 19. The device of claim 14, wherein the charge trapping layer comprises a dielectric material.
 20. The device of claim 14, wherein the first high-k dielectric layer physically contacts the first top oxide layer. 